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DB=PGPB. USPT. USOC,EPAB,JPAB,DWPI; PLUR=YES; OP=ADJ 

n L4 L3 and activity 229 

□ L3 hepatoma derived growth factor 244 

DB=USPT; PLUR^YES; OP=ADJ 

n L2 Llandhdgf 42 

n Ll hepatoma derived growth factor 49 
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FILE 'MEDLINE, EMBASE, BIOSIS' ENTERED AT 10:40:06 ON 21 MAR 2007 

LI 173 S HDGF 

L2 31 S LI AND RAT 

L3 17 DUPLICATE REMOVE L2 (14 DUPLICATES REMOVED) 

L4 29052 S HRP 

L5 197 S HEPATOMA DERIVED GROWTH FACTOR 

L6 29 S L5 AND HUH-7 

L7 11 DUPLICATE REMOVE L6 (18 DUPLICATES REMOVED) 

FILE 'STNGUIDE' ENTERED AT 10:42:33 ON 21 MAR 2007 

FILE 'GENBANK* ENTERED AT 10:49:19 ON 21 MAR 2007 

L8 IS AY061636 

L9 IS BC085707 

LIO 1 S AF458587 

Lll 1 S AF458586 
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CHINA 


TANG. LISHA 
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CHINA 


ZHANG. PINGZHAO 


SHANGHAI 


CHINA 


DONG, YIMIN 


SHANGHAI 


CHINA 
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GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



November 22, 2006, 19:14:17 ; Search time 5625 Seconds 

(without alignments) 
11254.746 Million cell updates/sec 

US-10-501-053-1 
990 

1 cgctatgtcttgcttcagcc ttgggaaacccgctagggcc 990 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1 . 0 



Searched: 6366136 seqs, 31973710525 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 200000 0000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



12732272 



Database 



GenEmbl : * 



1 




gb_env : 




2 




gb_pat : 




3 




gb_ph : * 




4 




gb_pl : * 




5 




gb_pr : * 




6 




gb_ro : * 




7 




gb_sts: 


* 


8 




gb_sy : * 




9 




gb_un : * 




10 


gb_vi : 


* 


11 


gb_o V : 


★ 


12 


gb_htg 


: * 


13 


gb_i n : 


* 


14 


gb_om : 


* 


15 


gb_ba: 





Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No 



% 

Query 



> . 


Score 


Match 


Length 


DB 


ID 


Description 


1 


966 


97 .6 


2068 


6 


BC085707 


BC085707 Rattus no 


2 


966 


97.6 


338115 


12 


AC111953 


AC111953 Rattus no 


3 


924.4 


93 ,4 


1512 


6 


AY061636 


AY061636 Rattus no 


4 


907.4 


91 .7 


909 


5 


AF458 58 7 . 


AF4 58 58 7 Homo sapi 


5 


885. 4 


89.4 


921 


6 


AF458586 


AF458586 Rattus no 


6 


572.6 


57 .8 


2046 


6 


BC108388 


BC108388 Mus muscu 
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c 


7 


572 . 6 


57 


.8 


188325 


6 


AL662807 


AL662 807 Mouse DNA 


c 


8 


572 . 6 


57 


. 8 


204891 


12 


AL83 7519 


AL837519 Mus muscu 




9 


571 


57 


. 7 


133 2 


2 


E1440 0 


E 14 40 0 cDNA encodi 




1 0 


571 


57 


. 7 


1333 


6 


D6366 3 


D63 66 3 Mus musculu 




1 1 


511 . 8 


51 


. 7 


870 


5 


AF451 983 


AF4 51983 Homo Scipi 




12 


503 . 6 


50 


. 9 


1856 


6 


AF 18010 9 


Ar±ouxu7 PIUS muscu 




13 


347.8 


35 


. 1 


78 3 


2 


E1700 7 


E 17 00 7 DNA en cod in 




14 


276 . 4 


27 


. 9 


495 


5 


AF451 984 


AF4 51984 Homo sapi 


c 


15 


237 . 2 


24 


. 0 


110000 


12 


CT03 06 94_3 


Continuation (4 of 


c 


16 


237.2 


24 


. 0 


^ ^ ^ ^ ^ 

223 771 


6 


AL772344 


Aij7 72 34 4 MOUSe UNA 




17 


226.8 


22 


. 9 


880 


2 


AR481 89 1 


AR4oio9i Sequence 




18 


226.8 


22 


. 9 


88 0 


2 


AXlll 704 


AX111704 Sequence 




19 


226 . 8 


22 


. 9 


143429 


5 


HS309H15 


AIj033 53 9 Human DNA 




20 


222 


22 


. 4 


2 03 4 


5 


AK056824 


AK056 82 4 Homo Sapi 




2 1 


211.6 


21 


. 4 


2 14 7 


14 


BCIO 95 29 


BC109529 Bos tauru 




22 


211.6 


21 


. 4 


18823 0 


12 


AC15 93 82 


AC159382 Bos tauru 




2 3 


208.4 


21 


- 1 


1 94 2 


14 


BTA2 37 666 


aj2375d5 Bos tauru 




24 


179.4 


18 


- 1 


1 n r\ ^ t\ 

180286 


6 


AL645 600 


ALo45 60U Mouse DNA 




25 


177 . 8 


18 


. 0 


124244 


6 


I^T AAAA^^ 

AL929 026 


AIi929 02 6 Mouse DNA 


c 


26 


175-6 


17 


. 7 


186202 


6 


AC125 319 


ACl25 3iy Mus muscu 


c 


27 


173.2 


17 


. 5 


194303 


6 


AC154296 


AC154 296 Mus muscu 




28 


170 . 8 


17 


. 3 


152842 


6 


AC156500 


AC156 50 0 Mus muscu 




29 


170 


17 


. 2 


213866 


6 


AC114589 


AC114 589 Mus muscu 


c 


30 


17 0 


17 


. 2 


234919 


6 


AC121887 


AC121887 Mus muscu 


c 


3 1 


166 . 6 


16 


. 8 


157754 


6 


AC132313 


AC132 313 Mus muscu 




32 


166 . 6 


16 


. 8 


228033 


6 


AC168050 


AC168 05 0 Mus muscu 


c 


33 


166 . 4 


16 


. 8 


175398 


6 


AC102 151 


AC102151 Mus muscu 


c 


34 


165 . 8 


16 


. 7 


144094 


6 


AC109193 


AC109193 Mus muscu 


c 


35 


165 . 4 


16 


. 7 


2 00574 


6 


BX005304 


« V A A r A il %ir A ..MA XT IV 

BX005304 Mouse DNA 


c 


36 


165 . 2 


16 


. 7 


186404 


6 


AC16963 0 


AC169 63 0 Mus muscu 




37 


165 . 2 


16 


. 7 


205054 


6 


AC098 712 


AC0 98 712 Mus muscu 




38 


165 


16 


. 7 


116887 


6 


AL929 153 


IVT ftOOIC^ XJIav « M M T>XT1V 

AIj929153 Mouse DNA 


c 


3 9 


165 


16 


. 7 


181676 


6 


AC 124194 


AC124194 Mus muscu 


c 


40 


165 


16 


. 7 


200180 


12 


AC169629 


AC 16 9629 Mus muscu 


c 


41 


164.6 


16 


.6 


174139 


12 


AC116700 


AC116700 Mus muscu 




42 


163.6 


16 


.5 


171554 


6 


AC 142 22 8 


AC142228 Mus muscu 




43 


163.6 


16 


.5 


183408 


6 


AC119873 


AC119873MUS muscu 




44 


163 


16 


,5 


163916 


6 


AC141887 


AC14188 7 Mus muscu 


c 


45 


162. 8 


16 


.4 


191753 


6 


AC125 23 5 


AC12523 5 Mus muscu 



ALIGNMENTS 



RESULT 1 
BC085707 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



BC085707 2068 bp raRNA linear ROD 20-OCT-2005 

Rattus norvegicus hepatoma -derived growth factor, related protein 
1, mRNA (cDNA clone MGC:93218 IMAGE : 7132556) , complete cds . 
BC085707 

BC085707. 1 GI: 55 715911 

MGC. 

Rattus norvegicus (Norway rat) 
Rattus norvegicus 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontogl ires ; Glires; Rodentia; 
Sciurognathi ; Muroidea; Muridae; Murinae; Rattus. 
1 (bases 1 to 2068) 

Strausberg, R. L. , Feingold, E. A. , Grouse, L .H . , Derge , J.G . , 

Klausner, R. D. , Collins, F.S . , Wagner, L., Shenmen, CM. , Schuler,G .D . , 

Altschul, S. F. , Zeeberg,B., Buetow,K.H., Schaef er , C . F . , Bhat,N.K., 
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Hopkins, R. F. , Jordan, H., Moore, T., Max,S.I., Wang, J., Hsieh,F., 
Diatchenko, L. , Marusina,K., Fairmer , A . A . , Rubin, G.M., Hong,L., 
Stapleton,M . , Soares, M, B. , Bonaldo,M.F . , Casavant, T. L. , 
Scheet2,T.E. , Brownstein,M . J . , Usdin,T.B. , Toshiyuki,S . , 
Carninci,P., Prange,C., Raha,S.S., Loquellano, N. A. , Peters, G. J., 
Abramson, R. D. , Mullahy, S, J . , Bosak, S .A . , McEwan, P. J. , 
McKernan, K. J. , Malek,J.A., Gunaratne , P .H . , Richards, S., 
Worley,K.C., Hale,S., Garcia, A.M., Gay,L.J., Hulyk,S.W., 
Villalon, D.K. , Muzny,D.M., Sodergren, E . J . , Lu,X., Gibbs,R.A., 
Fahey,J., Helton, E., Ketteman,M., Madan,A., Rodrigues, S. , 
Sanchez, A., Whiting, M., Madan,A., Young, A. C, Shevchenko , Y . , 
Bouf f ard, G.G. , Blakesley,R . W . r Touchman, J. W. , Green, E. D. , 
Dickson,M.C . , Rodriguez , A. C. , Grimwood, J . , Schmutz,J., Myers, R.M., 
Butterf ield, Y .S . , Krzywinski ,M . I . , Skalska,U. , Smailus , D .E . , 
Schnerch,A., Schein,J.E., Jones, S.J. and Marra,M.A. 
CONSRTM Mammalian Gene Collection Program Team 

TITLE Generation and initial analysis of more than 15,000 full-length 

human and mouse cDNA sequences . _ . 

JOURNAL Proc. Natl. Acad. Sci . U.S.A. 99 (26), 16899-16903 (2002); 
PUBMED 124 77 93 2 
REFERENCE 2 (bases 1 to 2068) 
AUTHORS 

CONSRTM NIH MGC Project 
TITLE Direct Submission 

JOURNAL Submitted ( 01 -NOV-2004) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Bethesda, MD 20892-2590, USA 

REMARK NIH -MGC Project URL: http://mgc.nci.nih.gov 
COMMENT Contact : MGC help desk 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Howard Jacobs 

cDNA Library Preparation: Express Genomics 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Sequencing Group at the Stanford Human Genome 

Center, Stanford University School of Medicine, Stanford, CA 94305 

Web site: http://www-shgc.stanford.edu 

Contact: (Dickson, Mark) mcd@paxil.stanford.edu 

Dickson, M. , Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, 
R. M. 



Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consort ium/ LLNL at: http://image.llnl.gov 
Series: IRAK Plate: 184 Row: p Column: 12 

This clone was selected for full length sequencing because it 
passed the following selection criteria: matched mRNA gi : 19424221. 
FEATURES Location/Qualifiers 
source 1. .2068 

/organisms "Rat tus norvegicus" 

/mol_type= "mRNA" 

/db_xref = " taxon : 1 01 16 " 

/clone="MGC: 93218 IMAGE : 713 2556 " 

/tissue_type="Testis, rat (Brown Norway)" 

/clone_lib="NIH_MGC_2 38" 

/lab_host="DH10B" 

/note= "Vector : pExpressl" 
gene 1. .2068 

/gene= "Pwwpl " 

/not e= "synonym : Hdgfrpl" 

/db_xref ="GeneID: 1710 74 " 

/db_xr ef = " RGD : 6 2 1 0 1 2 " 
CDS 31. .945 

/gene= "Pwwpl" 
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/ c odon_s t a r t = 1 

/products "hepatoma -derived growth factor, related protein 
1" 

/protein_id="AAH85707 .1" 
/db_xref ="01:55715912 " 
/db_xref ="GeneID: 171074" 
/db_xref =" RGD: 621012" 

/translations "MSCFSRPKYKTGDLVFAKLKGYAHWPARIEHVTEPNRYQVFFFG 
THETALLGPKHLFPYEESKERFGKPNKRRGFSEGLWEIEHDPMVEASPCLCPDEEQLC 
AEEPGPGEEPEP6QELEPESRPELESMPELEAEPRPEKECEQEPEQEPEQEPEQELEQ 
EPELEPEPEPEPEPEPEPEPEPEPQPAYDLLDAKEEPGLIEAEPGDQQAEQVREQHAE 
AEVMAWEEPESLKRSAEDEQPHSPPKRPREAAPGALEMEPAGEREAEACPF^EPDQ 
AQEQQTPLEEEATEEAVQGLMVGEIEGL" 



ORIGIN 



Query Match 97.6%; Score 966; DB 6; Length 2068; 

Best Local Similarity 98.9%; Pred. No. 1.7e-223; 

Matches 985; Conservative 0; Mismatches 5; Indels 6; Gaps 1; 
Qy 1 CGCTATGTCTTGCTTCAGCCGCCCAAAATACAAGACCGGGGACCTGGTGTTTGCCAAATT 60 

III lllllllllilllllMIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIMIIIII 

Db 27 CGCCATGTCTTGCTTCAGCCGCCCAAAATACAAGACCGGGGACCTGGTGTTTGCCAAATT 86 

Qy 61 AAAGGGCTATGCCCATTGGCCAGCGAGGATTGAACATGTCACTGAACCCAACCGCTACCA 12 0 

IIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIII 

Db 87 AAAGGGCTATGCCCATTGGCCAGCGAGGATTGAACATGTCACTGAACCCAACCGCTACCA 14 6 

Qy 121 GGTGTTCTTCTTCGGGACCCATGAGACCGCCCTGCTGGGCCCCAAGCACCTTTTTCCTTA 180 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIII 
Db 147 GGTGTTCTTCTTCGGGACCCATGAGACCGCCCTGCTGGGCCCCAAGCACCTTTTTCCTTA 206 

Qy 181 TGAGGAGTCCAAGGAGAGGTTCGGCAAGCCTAACAAGAGGCGCGGCTTCAGTGAGGGGCT 24 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIII llllllll llllll 

Db 2 07 TGAGGAGTCCAAGGAGAGGTTCGGCAAGCCTAACAAGAGGCGCGGCTTCAGTGAGGGGCT 26 6 

Qy 241 GTGGGAGATCGAGCACGACCCTATGGCTGAGGCCTCCCCTTGCCTGTGCCCAGATGAGGA 300 

I II II II II II II II MM II Mill Mill II II II llllllll llllll MM MM . 

Db 267 GTGGGAGATCGAGCACGACCCTATGGTTGAGGCCTCCCCTTGCCTGTGCCCAGATGAGGA 326 

Qy 3 01 GCAGCTTTGTGCCGAGGAGCCAGGGCCAGGAGAGGAGCCAGAGCCGGGGCAGGAGCTGGA 36 0 

III MM II II II II II MM III MM MM II MM MM MM II MM MM MM 

Db 3 27 GCAGCTTTGTGCCGAGGAGCCAGGGCCAGGAGAGGAGCCAGAGCCGGGGCAGGAGCTGGA 386 

Qy 3 61 GCCGGAATCCAGGCCTGAGCTGGAATCCATGCCTGAGCTGGAGGCAGAACCGAGGCCTGA 42 0 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 3 87 GCCGGAATCCAGGCCTGAGCTGGAATCCATGCCTGAGCTGGAGGCAGAACCGAGGCCTGA 44 6 

Qy 4 21 GAAAGAGTGTGAGCAGGAGCCGGAGCAGGAGCCGGAGCAGGAGCTGGAGCAGGAGCCGGA 48 0 

IIIIIIIIIIIMIIMIIMIIIIIIIIIIIIIIIIIIIIIM lllllllllll III 

Db 447 GAAAGAGTGTGAGCAGGAGCCGGAGCAGGAGCCGGAGCAGGAGCCGGAGCAGGAGCTGGA 506 
Qy 4 81 GCTGGAGCC GGAGCCGGAGCCGGAGCCGGAGCCGGAGCCGGAGCCCGAGCCCGA 53 4 

II llllll IIIIIIIMIIMIIIIIIIIIIIIIIMIIIIIIIIIMIIIII 

Db 5 07 GCAGGAGCCGGAGCTGGAGCCGGAGCCGGAGCCGGAGCCGGAGCCGGAGCCCGAGCCCGA 566 

Qy 535 GCCGGAGCCGGAGCCCCAGCCTGCCTATGACCTACTGGATGCCAAGGAGGAGCCTGGCCT 594 

IIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 
Db 567 GCCGGAGCCGGAGCCCCAGCCTGCCTATGACCTACTGGATGCC7UVGGAGGAGCCTGGCCT 62 6 

Qy 595 CATTGAGGCCGAGCCAGGAGATCAGCAAGCCGAGCAAGTGCGAGAGCAGCACGCTGAAGC 654 

III MM MM II MM MM III MM MM II llllllll II II II MM MM MM 
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Db 627 CATTGAGGCCGAGCCAGGAGATCAGCAAGCCGAGCAAGTGCGAGAGCAGCACGCTGAAGC 686 

Qy 6 55 TGAGGTCATGGCTGTAGTGGAGGAGCCGGAGAGTCTGAAGAGGAGCGCGGAGGATGAACA 714 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 6 87 TGAGGTCATGGCTGTAGTGGAGGAGCCGGAGAGTCTGAAGAGGAGCGCGGAGGATGAACA 74 6 

Qy 715 GCCTCACAGTCCTCCCAAACGGCCCAGGGAGGCGGCGCCTGGCGCGCTGGAGATGGAGCC 774 

. Illlllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 747 GCCTCACAGTCCTCCCAAACGGCCCAGGGAGGCGGCGCCTGGCGCGCTGGAGATGGAGCC 806 

Qy 7 75 GGCTGGAGAGCGCGAGGCAGAGGCCTGCCCCTTCGTGGAGGAGCCTGACCAAGCCCAGGA 834 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII 
Db 8 07 GGCTGGAGAGCGCGAGGCAGAGGCCTGCCCCTTCGTGGAGGAGCCTGACCAAGCCCAGGA 866 

Qy 835 ACAGCAGACTCCGTTGGAAGAAGAGGCCACAGAGGAGGCAGTCCAGGGCCTGATGGTTGG 894 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db . 8 67 ACAGCAGACTCCGTTGGAAGAAGAGGCCACAGAGGAGGCAGTCCAGGGCCTGATGGTTGG 92 6 

Qy 8 95 AGAAATCGAAGGCCTGTAGTCACGGTGTCTGTAAAAGAGCCCTCTCTACCCGTTCCTGGT 954 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 

Db 9 27 AGAAATCGAAGGCCTGTAGTCACGGTGTCTGTAAAAGAGCCCTCTCTACCCGTTCCTGGT 986 

Qy 955 GCCACCTGGCTGTGGCTTGGGAAACCCGCTAGGGCC 990 

lllllllllllllllllllllllllillllllllll 
Db 987 GCCACCTGGCTGTGGCTTGGGAAACCCGCTAGGGCC 1022 



RESULT 2 
AC111953 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



AC111953 338115 bp DNA linear HTG 09-NOV-2002 

Rattus norvegicus clone CH230-211F3, *** SEQUENCING IN PROGRESS 
***, 6 unordered pieces. 
AC111953 

AC11195 3.4 GI : 24 818630 

HTG; HTGS_PHASE1; HTGS_DRAFT; HTGS^ENRICHED . 
Rattus norvegicus (Norway rat) 
Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Crania ta; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Glires; Rodent ia; 
Sciurognathi ; Muroidea; Muridae; Murinae; Rattus. 
1 (bases 1 to 338115) 

Muzny, D .Marie . , Met zker ,M. Lee . , Abramzon,S., Adams, C, Alder, J. , 
Allen, C, Allen, H., Alsbrooks , S . , Amin,A., Anguiano,D., 
Anyalebechi , V. , Aoyagi,A., Ayodeji,M., Baca,E., Baden, H., 
Baldwin, D., Bandaranaike, D . , Barber, M., Barnstead, M. , Benahmed, F. , 
Biswalo,K., Blair, J., Blankenburg, K. , Blyth,P., Brown, M., 
Bryant, N., Buhay,C., Burch,P., Burrell,K., Calderon,E., 
Cardenas, v., Carter, K., Cavazos,!., Ceasar,H., Center, A., 
Chacko,J., Chavez, D., Chen,G,, Chen,R., Chen,Y., Chen,Z., Chu,J., 
Cleveland,C . , Cockrell,R., Cox,C., Coyle,M., Cree,A., D'Souza,L., 
Davila,M.L., Davis, C, Davy-Carroll , L . , De Anda,C., Dederich,D., 
Delgado,0., Denson,S., Deramo,C., Ding,Y., Dinh,H., Divya, K. , 
Draper, H., Dugan-Rocha , S . , Dunn, A., Durbin,K., Duval, B., Eaves, K,, 
Egan,A., Escotto,M., Eugene, C, Evans, C. A., Falls, T., Fan,G., 
Fernandez , S . , Finley,M., Flagg,N., Forbes, L., Foster, Ml, Foster,?., 
Fraser,C.M., Gabisi,A., Ganta,R., Garcia, A., Garner, T., Garza, M., 
Gebregeorgis, E. , Geer,K., Gill,R., Grady, M., Guerra,W., Guevara, W., 
Gunaratne,P. , Haaland,W., Hamil,C., Hamilton, C, Hamilton, K., 
Harvey, Y., Havlak,P., Hawes,A., Henderson, N. , Hernandez, J. , 
Hernandez ,R. , Hines,S., Hladun,S.L., Hodgson, A., Hogues,M., 
Hollins,B., Howells,S., Hulyk/s. , Hume, J., Idlebird,D., Jackson, A., 
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TITLE 
JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



Jackson, L., Jacob, L., Jiang, H., Johnson, B., Johnson, R,, Jolivet,A., 
Karpathy,S., Kelly, S., Kelly,S., Khan,Z., King,L., Kovar,C., 
Kowis,C., Kraft, C.L., Lebow,H., Levan,J., Lewis, L., Li , Z . , Liu, J., 
Liu, J., Liu,W., Liu,Y., London,?., Longacre,S., Lopez, J., 
Lorensuhewa,L. , Loulseged, H. , Lozado,R.J., Lu,X., Ma, J., 
Maheshwari , M. , Mahindartne , M . , Mahmoud,M., Malloy,K., Mangum,A. , 
Mangum,B., Mapua,P., Martin, K., Martin, R., Martinez, E., 
Mawhiney,S., McLeod,M.P., McNeill , T. Z. , Meenen,E., 
Milosavl j evic , A . , Miner, G. , Minja,E., Mont emay or , J . , Moore, S., 
Morgan, M., Morris, K., Morris, S., Munidasa,M., Murphy, M., Nair,L., 
Nankervis , C . , Neal,D., Newton, N., Nguyen, N., Norris,S., 
Nwaokelemeh , O . , Okwuonu,G., Olarnpunsagoon , A . , Pal , S . , Parks, K., 
Pasternak, 8 . , Paul,H., Perez, A., Perez, L., Pfannkoch, C . , 
Plopper,F., Poindexter, A. , Popovic,D., Primus, E., Pu,L.-L., 
Puazo,M., Quiroz,J., Rachlin,E., Reeves, K., Regier,M.A., Reigh,R., 
Reilly,B., Reilly,M., Ren,Y., Reuter,M., Richards, S., Riggs,F., 
Rives, C, Rodkey,T., Rojas,A., Rose,M., Rose,R., Ruiz, S. J., 
Sanders, W., Savery,G., Scherer,S., Scott, G., Shatstnan, S. , Shen,H., 
Shetty,J., Shvartsbeyn, A. , Sisson,!., Sitter, CD., Smajs,D., 
Sneed,A., Sodergren , E . , Song,X.-Z., Sorelle,R., Sosa,J., 
Steimle,M., Strong, R., Sutton, A., Svatek,A., Tabor, P., Taylor, C, 
Taylor, T., Thomas, N., Thomas, S., Tingey,A., Trejos,Z., Usmani,K., 
Valas,R., Vera, v., Villasana , D . , Waldron,L., Walker, B., Wang, J. , 
Wang,Q., Wang,S., Warren, J., Warren, R., Wei,X., White, F., 
Williams, G,, Willson,R., Wlec2yk,R., Wooden, H., Worley,K., 
Wright, D., Wright, R., Wu,J., Yakub,S., Yen, J., Yoon,L., Yoon,V. , 
Yu,F., Zhang, J., Zhou, J., Zhou,X., Zhao,S., Dunn,D., von 
Niederhausern,A. , Weiss, R. , Smith, D.R., Holt, R. A., Smith, H.O., 
Weinstock,G. and Gibbs,R.A. 
Direct Submission 
Unpublished 

2 (bases 1 to 338115) 
Worley,K.C. 

Direct Submission 

Submitted ( 19 -FEB -2002) Human Genome Sequencing Center, Department 
of Molecular and Human Genetics, Baylor College of Medicine, One 
Baylor Plaza, Houston, TX 77030, USA 

3 (bases 1 to 338115) 

Rat Genome Sequencing Consortium. 
Direct Submission 

Submitted ( 09 -NOV -200 2) Human Genome Sequencing Center, Department 
of Molecular and Human Genetics, Baylor College of Medicine, One 
Baylor Plaza, Houston, TX 77030, USA 

On Nov 9, 2002 this sequence version replaced gi:23101342. 
The sequence in this assembly is a combination of SAC based reads 
and whole genome shotgun sequencing reads assembled using Atlas 
(http : //www ,hgsc. bcm. tmc.edu/projects/rat/) . Each contig described 
in the feature table below represents a scaffold in the Atlas 
assembly (a ' cont ig-scaf f old* ) - Within each contig-scaf f old, 
individual sequence contigs are ordered and oriented, and separated 
by sized gaps filled with Ns to the estimated size. The sequence 
may extend beyond the ends of the clone and there may be sequence 
contigs within a cont ig-scaf f old that consist entirely of whole 
genome shotgun sequence reads. Both end sequences and whole genome 
shotgun sequence only contigs will be indicated in the feature 
table. 

Genome Center 

Center: Baylor College of Medicine 
Center code : BCM 

Web site: http://www.hgsc.bcm.tmc.edu/ 
Contact: hgsc-help@bcm.tmc.edu 
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Project Information 

Center project name: GOSJ 

Center clone name: CH2 30-211F3 
Summary Statistics 

Assembly program: Phrap; version 0.990329 

Consensus quality: 233778 bases at least Q40 

Consensus quality: 236747 bases at least Q30 

Consensus quality: 238 950 bases at least Q20 

Estimated insert size: 235437; sum-of -contigs estimation 

Quality coverage: 6x in Q2 0 bases; sum-of -contigs estimation 



NOTE; Estimated insert size may differ from sequence length 

(see http : //www .hgsc.bcm.tmc .edu/docs/Genbank_draft_data .html) 
NOTE: This is a 'working draft' sequence. It currently 
consists of 6 contigs. The true order of the pieces 
is not known and their order in. this sequence record is 
arbitrary. Gaps between the contigs are represented as 
runs of N, but the exact sizes of the gaps are unknown. 
This record will be updated with the finished sequence 
as soon as it is available and the accession number will 



* 


be preserved. 








it 


1 


12369 : 


contig 


of 12369 


bp in length 


•k 


12370 


12469 : 


gap of 


unknown 


length 


* 


12470 


34481 : 


contig 


of 22012 


bp in length 


* 


34482 


34581 : 


gap of 


unknown 


length 


* 


34582 


329483 : 


contig 


of 2 94 902 bp in length 


* 


329484 


329583: 


gap of 


unknown 


length 


* 


329584 


330973 : 


contig of 1390 


bp in length 


* 


330974 


331073 : 


gap of 


unknown 


length 




331074 


334003 : 


contig 


of 2930 


bp in length 


* 


334004 


334103 : 


gap of 


unknown 


length 




334104 


338115 : 


contig of 4012 


bp in length. 



FEATURES 

source 



gap 



misc feature 



gap 



misc feature 



misc feature 



gap 



gap 



gap 



Location/Qualif iers 
1. .338115 

/organism= "Rat tus norvegicus" 

/mol_type= "genomic DNA" 

/db_xr ef = " taxon : 1 01 16 " 

/clone="CH23 0-211F3 " 

12370. .12469 

/est imat ed_l ength ^unknown 

12470. .13894 

/note= "wgs_contig " 

34482. .34581 

/est imat ed_l ength =unknown 

56734. .58848 

/note= "wgs_contig " 

59170. .60297 

/not e= " wgs_contig " 

329484. .329583 

/e St imat ed_l ength =unknown 

330974. .331073 

/e St imat ed_l ength =unknown 

334004. .334103 

/est imat ed_l ength =unknown 



ORIGIN 



Query Match 97.6%; 
Best Local Similarity 98.9%; 
Matches 985; Conservative 



Score 966; DB 12; Length 338115; 
Pred. No. 1.9e-223; 
0; Mismatches 5; Indels 6; Gaps 



1; 



Qy 



1 CGCTATGTCTTGCTTCAGCCGCCCAAAATACAAGACCGGGGACCTGGTGTTTGCCAAATT 60 
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III IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


116984 


CGCCATGTCTTGCTTCAGCCGCCC7VAAATACAAGACCGGGGACCTGGTGTTTGCCAAATT 


117043 


Ov 
wy 


61 


A AA GC TATG CC r A TT CC AGC OA AT TG AA CA.TG TC AC 


120 






IIIIIIMIMIIMIMMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIilllll 




Db 


117044 


AAAGGGCTATGCCCATTGGCCAGCGAGGATTGAACATGTCACTGAACCCAACCGCTACCA 


117103 


Ov 


121 

X 4b X 


GGTGT TC TT CT TO GG GA CO CATGA G A CC GC CC TG CTGG GC CC C A AG CA CC TT TT TC CT T A 

WW X V7 X XwXXwX X ^ V7VJ w.rt ^ wA X \Jr\\^\^\J\^ ^w X Vj V- X V3w w^ V> wri.n.V7 wx^VvVr X X X X XVvVvX 


Ifi 0 

XO V 






llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


117104 


GGTGTTCTTCTTCGGGACCCATGAGACCGCCCTGCTGGGCCCCAAGCACCTTTTTCCTTA 


117163 


Ov 

vy 


J. O X 


X Vji^vJVJ/^Vj X \^t\.±\\j \jr\ X X v_ yj\j \^t\r\\j \^ X t\r\\^ V_.vj X X V^rWj X Vj/\ \J\J \^ X 








IIIIIIMIIMI lllllllilllllll MM llllllllllllllllllllllllllll 




Db 


117164 


TGAGGAGTCCAAGGAGAGGTTCGGCAAGCCTAACAAGAGGCGCGGCTTCAGTGAGGGGCT 


117223 


Ov 

wy 


241 


O TO GG AG AT CG AG CA CO AC CC TAT OO CT OA OO CC TC CC CT TO CC TG TO CC C A OA TO AOOA 


300 






IIIMMMIIIIIIMMIIIIIII IMIIMMMIIMMIIMMIIMMIIM 




Db 


117224 


GTGGGAGATCGAGCACGACCCTATGGTTGAGGCCTCCCCTTGCCTGTGCCCAGATGAGGA 


117283 


Ov 

wy 


J U X 


OCAOCTTTGTOCCOAOOAOCCAOOOCCAGGAOAGGAGCCAOAOCCOOOOCAOOAOCTOOA 








IIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIMMIIMMIIII 




Db 


117284 


GCAGCTTTGTGCCGAGGAGCCAGGGCCAGGAGAGGAGCCAGAGCCGGGGCAGGAGCTGGA 


117343 


Ov 

wy 


^ D X 


OCCOGAATCCAGOCCTOAOCTOOA ATCCATOCCTOAOCTOOAOOCAOAACCOAOOCCTOA 


42 0 






1 M M II II M M II M II M Ml II II M M II M M IIMII M II II II M M II II 




Db 


117344 


GCCGGAATCCAGGCCTGAGCTGGAATCCATGCCTGAGCTGGAGGCAGAACCGAGGCCTGA 


117403 


Ov 

wy 


421 


GAAAGAGTGTGAGCAGGAGCCGGAGCAGGAGCCGGAGCAGGAGCTGGAGCAGOAGCCGGA 


48 0 






IIIIIIIMIIMIIIIIIIIIMilllllllllllMIIIIII IIIIIIIIIM III 




Db 


1174 04* 


GAAAGAGTGTGAGCAGGAGCCGGAGCAGGAGCCGGAGCAGGAGCCGGAGCAGGAGCTGGA 


117463 


Ov 

wy 


*x o X 


(^CTC^A(%CC OOAOCCOOAOCCOOAOCCOOAOCCOOAOCCOOAOCCCOAOCCCOA 


3 J t 






II MUM IIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 




Db 


117464 


GCAGGAGCCGGAGCTGGAGCCGGAGCCGGAGCCGGAGCCGGAGCCGGAGCCCGAGCCCGA 


117523 


Ov 

wy 




OCPOG AOOOOO AOOOPr' AOPPTOP PT ATOA PPTA PTOO ZVTOPPA AOnA/ZO AflPPTf^OPPT 


c^Q 4 






IMIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


117524 


GCCGGAGCCGGAGCCCCAGCCTGCCTATGACCTACTGGATGCCAAGGAGGAGCCTGGCCT 


117583 


Ov 
wy 


595 


CATTGAGGCCGAGCCAGGAGATCAGCAAGCCGAGCAAGTGCGAGAGCAGCACGCTGAAGC 


654 






IIIIIIIMIIII IIMIIIIIII IIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIM 




Db 


117584 


CATTGAGGCCGAGCCAGGAGATCAGCAAGCCGAGCAAGTGCGAGAGCAGCACGCTGAAGC 


117643 


Ov 

wy 


655 


T OA OO TC AT OO CT OT AO TOO A OO A OC COO A OA OT CT OA AO AGO A GC GC GG AG G A TG AA C A 


714 






lllll II MM MM MINIMI II IIMII MINI IIIIMIIMIIIIIIIIMM 




Db 


117644 


TGAGGTCATGGCTGTAGTGGAGGAGCCGGAGAGTCTGAAGAGGAGCGCGGAGGATGAACA 


117703 


Ov • 

wy 


1 xs 


rjpprripTVf-i AriTPPTPPPA A APGGPPPAGGGAGGPGGPGPPTGGPGPGPTGG AG ATGGAGPP 


7*7 4 






IMIIIIIIIIIIIIMIIIIIMIIIIMMIIIIIIMIIIIIIMIIIIMIIMII 




Db 


117704 


GCCTCACAGTCCTCCCAAACGGCCCAGGGAGGCGGCGCCTGGCGCGCTGGAGATGGAGCC 


117763 


Ov 
wy 


7 75 


CiC^CTCzciiiCiTincrzmii.dCiCAnAacic ptgppppttpgtgg aggagpptgapp aagppp agga 

VJ nJV_ X wwnwAwV^ w^ wAwwVa>/AwAwwV» X vjv— v» X X O X vJ vJ xAVJ Vj/a \J X VjA^v^n/Aw^ \^ vj \J.fTk 


P"^ 4 
O J t 






IMMMMIIMMIIMMMIMMMMIIMMMMMMMMMMMMM 




Db 


117764 


GGCTGGAGAGCGCGAGGCAGAGGCCTGCCCCTTCGTGGAGGAGCCTGACCAAGCCCAGGA 


117823 


Qy 


835 


ACAGCAGACTCCGTTGGAAGAAGAGGCCACAGAGGAGGCAGTCCAGGGCCTGATGGTTGG 


894 






1 II II II II II II II 1 1 II II II 1 II II 1 111 II II M II 1 11 1 II II II II II II II II 




Db 


117824 


ACAGCAGACTCCGTTGGAAGAAGAGGCCACAGAGGAGGCAGTCCAGGGCCTGATGGTTGG 


117883 


Qy 


895 


AGAAATCGAAGGCCTGTAGTCACGGTGTCTGTAAAAGAGCCCTCTCTACCCGTTCCTGGT 


954 



llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
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Db 

Qy 

Db 



117884 AGAAATCGAAGGCCTGTAGTCACGGTGTCTGTAAAAGAGCCCTCTCTACCCGTTCCTGGT 117943 



955 



990 



GCCACCTGGCTGTGGCTTGGGAAACCCGCTAGGGCC 

IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIII 
117944 GCCACCTGGCTGTGGCTTGGGAAACCCGCTAGGGCC 117979 



^RESULT 3 
AY06163€^ 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



CDS 



AY061636 1512 bp mRNA linear ROD 13-NOV-2001 

Rattus norvegicus hepatoma -derived apoptotic -related protein mRNA, 
complete cds. 
AY061636 

AY061636.1 GI:16923254 

Rattus norvegicus (Norway rat) 
Rattus norvegicus 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Glires; Rodentia; 
Sciurognathi ; Muroidea; Muridae; Murinae; Rattus. 

1 (bases 1 to 1512) 

Tirado,O.M., Selva,D.M., Munell,F., Suarez -Quian, C . and Reventos,J. 

A new member of the HDGF family, an apoptotic role? 

Unpublished 

2 (bases 1 to 1512) 

Tirado,O.M., Selva,D.M., Munell,F. and Reventos,J. 
Direct Submission - ^ 

Submitted (01-NOV-200^aa^ URB, Hospitals Vail d'Hebron, Pso Vail 
d'Hebron 119-129, Barcelona, Spain 

Loca t i on /Qua 1 i f ie rs 

1. .1512 

/organisms "Rattus norvegicus" 
/mol_type= "mRNA" 
/strain^ "Sprague-Dawley" 
/db_xr ef = " taxon : 1 01 16 " 
/dev_stage=" 10 weeks old" 
13. .903 

/note="HARP; growth factor" 
/codon_start=l 

/product ="hepatoma-derived apoptotic -related protein" 
/protein_id= "AAL2 9938 .1" 
/db_xref="GI : 16923255" 

/translation="MSCFSRPKYKTGDLVFAKLKGYAHWPARIEHVTEPNRYQVFFFG 
THETALLGPKHLFPYEESKERFGKPNKRRGFSEGLWEIEHDPMVEASPCLCPDEEQLC 
AEEPGPGEEPEPGQELEPESRPELESMPELEAEPRPEKECEQEPEQEPEQEPEQELEQ 
EPELEPEPEPEPEPEPEPEPSLPMTYWMPGLIEAEPGDQQAEQVREQHAEAEVMAVVE 
EPESLKRSAEDEQPHSPPKRPREAAPGALEMEPAGEREAEACPFVEEPDQAQEQQTPL 
EEEATEEAVQGLMVGEI EGL" 



ORIGIN 



Query Match 93.4%; Score 924.4; DB 6 

Best Local Similarity 97.6%; Pred . No. 2.2e-213 
Matches 966; Conservative 0; Mismatches 6 



Length 1512; 

Indels 18 ; Gaps 2 ; 



Qy 



Db 



1 CGCTATGTCTTGCTTCAGCCGCCCAAAATACAAGACCGGGGACCTGGTGTTTGCCAAATT 

I II II II II II II II II II II III II MM II II II II II II II II II II II 1 1 II II II 

9 CGCTATGTCTTGCTTCAGCCGCCCAAAATACAAGACCGGGGACCTGGTGTTTGCCAAATT 



60 



68 



Qy 
Db 



61 AAAGGGCTATGCCCATTGGCCAGCGAGGATTGAACATGTCACTGAACCCAACCGCTACCA 12 0 

IIIIIIIIIIIIIIIIIIIMIM IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIII 

69 AAAGGGCTATGCCCATTGGCCAGCGAGGATTGAACATGTCACTGAACCCAACCGCTACCA 12 8 
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Qy 121 GGTGTTCTTCTTCGGGACCCATGAGACCGCCCTGCTGGGCCCCAAGCACCTTTTTCCTTA 180 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIII . 

Db 129 GGTGTTCTTCTTCGGGACCCATGAGACCGCCCTGCTGGGCCCCAAGCACCTTTTTCCTTA 188 

Qy 181 TGAGGAGTCCAAGGAGAGGTTCGGCAAGCCTAACAAGAGGCGCGGCTTCAGTGAGGGGCT 24 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIII 

Db 189 TGAGGAGTCCAAGGAGAGGTTCGGCAAGCCTAACAAGAGGCGCGGCTTCAGTGAGGGGCT 24 8 

Qy 241 GTGGGAGATCGAGCACGACCCTATGGCTGAGGCCTCCCCTTGCCTGTGCCCAGATGAGGA 300 

llllllllllllllllllllllllll IIIIIMIIIIIMIIIIIIIIIIIIIIIIIII 
Db 249 GTGGGAGATCGAGCACGACCCTATGGTTGAGGCCTCCCCTTGCCTGTGCCCAGATGAGGA 308 

Qy 3 01 GCAGCTTTGTGCCGAGGAGCCAGGGCCAGGAGAGGAGCCAGAGCCGGGGCAGGAGCTGGA 36 0 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 3 09 GCAGCTTTGTGCCGAGGAGCCAGGGCCAGGAGAGGAGCCAGAGCCGGGGCAGGAGCTGGA 36 8 

Qy 3 61 GCCGGAATCCAGGCCTGAGCTGGAATCCATGCCTGAGCTGGAGGCAGAACCGAGGCCTGA 42 0 

IIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 3 69 GCCGGAATCCAGGCCTGAGCTGGAATCCATGCCTGAGCTGGAGGCAGAACCGAGGCCTGA 42 8 

Qy . 421 GAAAGAGTGTGAGCAGGAGCCGGAGCAGGAGCCGGAGCAGGAGCTGGAGCAGGAGCCGGA 480 

llllllllllllllllllllllllllllllllllllllllllll IIIIIIIIIM III 
Db 429 GAAAGAGTGTGAGCAGGAGCCGGAGCAGGAGCCGGAGCAGGAGCCGGAGCAGGAGCTGGA 48 8 

Qy 4 81 GCTGGAGCCGGAGCCGGAGCCGGAGCCGGAGCCGGAGCCGGAGCCCGAGCCCGAGCCGGA 54 0 

M IMIIIIMII IIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIII llllllll 

Db 4 89 GCAGGAGCCGGAGCTGGAGCCGGAGCCGGAGCCGGAGCCGGAGCCCGAGCCGGAGCCGGA 54 8 

Qy 541 GCCGGAGCCCCAGCCTGCCTATGACCTACTGGATGCCAAGGAGGAGCCTGGCCTCATTGA 600 

I lllllllillllllllllllllllll Illlllllillllll 
Db 549 G CCCAGCCTGCCTATGACCTACTGGAT GCCTGGCCTCATTGA 5 90 

Qy 6 01 GGCCGAGCCAGGAGATCAGCAAGCCGAGCAAGTGCGAGAGCAGCACGCTGAAGCTGAGGT 66 0 

I llllllll MM II Mil II III II II llllllll MM llllllll II II II llllll 

Db 5 91 GGCCGAGCCAGGAGATCAGCAAGCCGAGCAAGTGCGAGAGCAGCACGCTGAAGCTGAGGT 65 0 

Qy 661 CATGGCTGTAGTGGAGGAGCCGGAGAGTCTGAAGAGGAGCGCGGAGGATGAACAGCCTCA 72 0 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 651 CATGGCTGTAGTGGAGGAGCCGGAGAGTCTGAAGAGGAGCGCGGAGGATGAACAGCCTCA 710 

Qy 721 CAGTCCTCCCAAACGGCCCAGGGAGGCGGCGCCTGGCGCGCTGGAGATGGAGCCGGCTGG 78 0 

IIIMIIIIIIIIIIIIIIIIIII IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIII 

Db 711 CAGTCCTCCCAAACGGCCCAGGGAGGCGGCGCCTGGCGCGCTGGAGATGGAGCCGGCTGG 77 0 

Qy 781 AGAGCGCGAGGCAGAGGCCTGCCCCTTCGTGGAGGAGCCTGACCAAGCCCAGGAACAGCA 84 0 

IMIIIIMII llllll MM III llllllllllllll II MM IIIIIIIIMII II II 

Db 771 AGAGCGCGAGGCAGAGGCCTGCCCCTTCGTGGAGGAGCCTGACCAAGCCCAGGAACAGCA 83 0 

Qy 841 GACTCCGTTGGAAGAAGAGGCCACAGAGGAGGCAGTCCAGGGCCTGATGGTTGGAGAAAT 900 

IIIIIIIIIIIIIIIIIMII III llllllllllllll II MM llllllllllllll II 

Db 831 GACTCCGTTGGAAGAAGAGGCCACAGAGGAGGCAGTCCAGGGCCTGATGGTTGGAGAAAT 89 0 

Qy 9 01 CGAAGGCCTGTAGTCACGGTGTCTGTAAAAGAGCCCTCTCTACCCGTTCCTGGTGCCACC 96 0 

I II llllll II II II II IIIIIIIIIM llllll II MM II II llllll llllll MM 

Db 891 CGAAGGCCTGTAGTCACGGTGTCTGTAAAAGAGCCCTCTCTACCCGTTCCTGGTGCCACC 95 0 

Qy 961 TGGCTGTGGCTTGGGAAACCCGCTAGGGCC 990 

llllllllllllllllllllllllllllll 
Db 951 TGGCTGTGGCTTGGGAAACCCGCTAGGGCC 980 
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RESULT 4 
AF458587 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



CDS 



AF458587 909 bp mRNA linear PRI lO-JUN-2002^ 

Homo sapiens hepatoma -derived growth factor HGDF5 mRNA, complete 
cds . 

AF458587 

AF458587. 1 GI: 21360801 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Catarrhini; 
Hominidae ; Homo . 
1 (bases 1 to 909) 

Zhang, P. Z., Dong,Y.M., Shan,Y.X., Wei,Y.H., Guo,J.H. andYu,L. 
Direct Submission 

Submitted (15-DEC-2001)' School of Life Sciences, Institute of 
Genetics, Fudan University, Handan Road, 220, Shanghai 200433, 
China 

Location/Qualifiers 
1. .909 

/organisms "Homo sapiens" 
/mol_type= "mRNA" 
/db_xref =" taxon; 9 606" 
/tissue_type=" liver" 
1. .909 

/codon_start=l 

/products "hepatoma -derived growth factor HGDF5" 
/protein_id= "AAM4 9715 .1" 
/db_xref ="GI :21360802" 

/translations" MSCFSRPKYKTGDLVFAKLKGYAHWPARIEHVTEPNRYQVFFFG 
THETALLGPKHLFPYEESKERFGKPNKRRGFSEGLWEIEHDPMVEASPCLCPDEEQLC 
AEEPGPGEEPEPGQELEPESRPELESMPELEAEPRPEKECEQEPEQEPEQELEQEPEL 
EPEPEPEPEPEPEPEPEPEPEPQPAYDLLDAKEEPGLIEAEPGDQQAEQVREQHAEAE 
VMAVVEEPESLKRSAEDEQPHSPPKRPREAAPGALEMEPAGEREAEACPFVEEPDQAQ 
EQQT PL EE BATE EA VQG LM VGE I EGL " 



ORIGIN 



Query Match 91.7%; 
Best Local Similarity 99.9%; 
Matches 90 8; Conservative 



Score 90 7.4; DB 5; 
Pred. No. 2.8e-209; 
0; Mismatches 1; 



Length 909; 
Indels 0; 



Gaps 



0; 



Qy 



Db 



5 ATGTCTTGCTTCAGCCGCCCAAAATACAAGACCGGGGACCTGGTGTTTGCCAAATTAAAG 64 

IIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
1 ATGTCTTGCTTCAGCCGCCCAAAATACAAGACCGGGGACCTGGTGTTTGCCAAATTAAAG 60 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



65 GGCTATGCCCATTGGCCAGCGAGGATTGAACATGTCACTGAACCCAACCGCTACCAGGTG 124 

IIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
61 GGCTATGCCCATTGGCCAGCGAGGATTGAACATGTCACTGAACCCAACCGCTACCAGGTG 12 0 

125 TTCTTCTTCGGGACCCATGAGACCGCCCTGCTGGGCCCCAAGCACCTTTTTCCTTATGAG 184 

IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIMIIIII 

121 TTCTTCTTCGGGACCCATGAGACCGCCCTGCTGGGCCCCAAGCACCTTTTTCCTTATGAG 18 0 



185 



244 



GAGTCCAAGGAGAGGTTCGGCAAGCCTAACAAGAGGCGCGGCTTCAGTGAGGGGCTGTGG 

IIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIII 
181 GAGTCCAAGGAGAGGTTCGGCAAGCCTAACAA6AGGCGCGGCTTCAGTGAGGGGCTGTGG 24 0 



245 GAGATCGAGCACGACCCTATGGCTGAiGGCCTCCCCTTGCCTGTGCCCAGATGAGGAGCAG 
llllllllllllllllllllll lllllllllllllllllllllllllllllllllllll 



304 
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Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 
Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 



241 GAGATCGAGCACGACCCTATGGTTGAGGCCTCCCCTTGCCTGTGCCCAGATGAGGAGCAG 300 



305 



845 



841 



364 



CTTTGTGCCGAGGAGCCAGGGCCAGGAGAGGAGCCAGAGCCGGGGCAGGAGCTGGAGCCG 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIM 
3 01 CTTTGTGCCGAGGAGCCAGGGCCAGGAGAGGAGCCAGAGCCGGGGCAGGAGCTGGAGCCG 360 



3 65 GAATCCAGGCCTGAGCTGGAATCCATGCCTGAGCTGGAGGCAGAACCGAGGCCTGAGAAA 

I II II II 11.11 II II II II II III II II II II II I Ml II II II MM 1 1 II II Mil II 

3 61 GAATCCAGGCCTGAGCTGGAATCCATGCCTGAGCTGGAGGCAGAACCGAGGCCTGAGAAA 
425 GAGTGTGAGCAGGAGCCGGAGCAGGAGCCGGAGCAGGAGCTGGAGCAGGAGCCGGAGCTG 

I 1 1 1 1 1 1 1 II M I II M 1 1 1 1 1 II 1 1 1 1 M 1 1 1 1 II 1 1 1 1 II II 1 1 1 1 II 1 1 1 II I M 1 1 

4 21 GAGTGTGAGCAGGAGCCGGAGCAGGAGCCGGAGCAGGAGCTGGAGCAGGAGCCGGAGCTG 
4 85 GAGCCGGAGCCGGAGCCGGAGCCGGAGCCGGAGCCGGAGCCCGAGCCCGAGCCGGAGCCG 

IIMIIIMIIIIMIIIIIIIilllllllllllllllMIIIIIIMIIIIMIIIIII 

4 81 GAGCCGGAGCCGGAGCCGGAGCCGGAGCCGGAGCCGGAGCCCGAGCCCGAGCCGGAGCCG 
545 GAGCCCCAGCCTGCCTATGACCTACTGGATGCCAAGGAGGAGCCTGGCCTCATTGAGGCC 

IIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIII 

541 GAGCCCCAGCCTGCCTATGACCTACTGGATGCCAAGGAGGAGCCTGGCCTCATTGAGGCC 
605 GAGCCAGGAGATCAGCAAGCCGAGCAAGTGCGAGAGCAGCACGCTGAAGCTGAGGTCATG 

IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIM 

6 01 GAGCCAGGAGATCAGCAAGCCGAGCAAGTGCGAGAGCAGCACGCTGAAGCTGAGGTCATG 
6 65 GCTGTAGTGGAGGAGCCGGAGAGTCTGAAGAGGAGCGCGGAGGATGAACAGCCTCACAGT 

•I II 1 1 II II II II II 1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 II M 1 1 II M 1 1 1 1 

6 61 GCTGTAGTGGAGGAGCCGGAGAGTCTGAAGAGGAGCGCGGAGGATGAACAGCCTCACAGT 



424 



420 



484 



480 



544 



54 0 



604 



600 



664 



660 



724 



720 



784 



7 25 CCTCCCAAACGGCCCAGGGAGGCGGCGCCTGGCGCGCTGGAGATGGAGCCGGCTGGAGAG 

I II II II M II 1 1 II 1 1 II II II M 1 1 II 1 1 1 1 1 II 1 1 M II M II M M M II II M 1 1 

721 CCTCCCAAACGGCCCAGGGAGGCGGCGCCTGGCGCGCTGGAGATGGAGCCGGCTGGAGAG 780 



785 



844 



CGCGAGGCAGAGGCCTGCCCCTTCGTGGAGGAGCCTGACCAAGCCCAGGAACAGCAGACT 

IIIIIIIMIIIIIIIIIIIIIIIIII IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 
7 81 CGCGAGGCAGAGGCCTGCCCCTTCGTGGAGGAGCCTGACCAAGCCCAGGAACAGCAGACT 84 0 



CCGTTGGAAGAAGAGGCCACAGAGGAGGCAGTCCAGGGCCTGATGGTTGGAGAAATCGAA 

IIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIII 

CCGTTGGAAGAAGAGGCCACAGAGGAGGCAGTCCAGGGCCTGATGGTTGGAGAAATCGAA 



904 



900 



9 05 GGCCTGTAG 913 

IIIIIMII 
901 GGCCTGTAG 9 09 



RESULT 5 
AF4 58 586 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 



AF458586 921 bp mRNA linear ROD 24-JUL-2003 

Rattus norvegicus hepatoma -derived growth factor variant, mRNA, 
complete cds. 
AF458586 

AF45858 6. 1 GI: 331877 02 

Rattus norvegicus (Norway rat) 
Rattus norvegicus 

Eukaryota; Metazoa; Chorda ta; Crania ta; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Glires; Rodentia; 
Sciurognathi; Muroidea; Muridae; Murinae; Rattus. 
1 (bases 1 to 921) 
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AUTHORS Zhang, P. Z., Dong,Y.M., Shan,Y.X., Wei,Y.H., Guo,J.H. andYu,L. 

TITLE Direct Submission 

JOURNAL Submitted ( 11 -DEC -200 1) School of Life Sciences, Intitute of 

Genetics, Fudan University, Handan Road, 220, Shanghai 200433, 
China 

FEATURES Location/Quali f iers 

source 1. .921 

/organism= "Rattus norvegicus" 

/mol_type= "mRNA" 

/strain= "SD" 

/db_xr ef = " taxon : 1 01 16 " 

/t is sue_type=" testis" 
CDS 1. .921 

/note="HDGF; hepatoma -derived apoptotic -related protein 

variant" 

/codon_start=l 

/product =" hepatoma -derived growth factor variant" 
/protein_id= "AAP9 77 06 . 1 " 
/db_xref ="GI :3 318 7703" 

/translation="MSCFSRPKYKTGDLVFAKLKGYAHWPARIEHVTEPNRYQVFFFG 
THETALLGPKHLFPYEESKERFGKPNKRRGFSEGLWEIEHDPMVEASPCLCPDEEQLC 
AEEPGPGEEPEPGQELEPESRPELESMPELEAEPRPEKECEQEPEQEPEQEPEQELEQ 
EPELEPEPEPEPEPEPEPEPEPEPEPQPAYDLLDAKEEPGLIEAEPGDQQAEQVREQH 
AEAEVMAVVEEPESLKRSAEDEQPHSPPKRPREAAPGALEMEPAGEREAEACPFVEEP 
DQ AQ EQQT PL EE EATEE AVQGLM VGE I EGL " 

ORIGIN 

Query Match 8 9.4%; Score 885.4; DB 6; Length 921; 

Best Local Similarity 98.6%; Pred. No. 6.2e-204; 

Matches 908; Conservative 0; Mismatches 1; Indels 12; Gaps 1; 

Qy 5 ATGTCTTGCTTCAGCCGCCCAAAATACAAGACCGGGGACCTGGTGTTTGCCAAATTAAAG 64 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1 ATGTCTTGCTTCAGCCGCCCAAAATACAAGACCGGGGACCTGGTGTTTGCCAAATTAAAG 60 

Qy 65 GGCTATGCCCATTGGCCAGCGAGGATTGAACATGTCACTGAACCCAACCGCTACCAGGTG 124 

IIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIII 

Db 61 GGCTATGCCCATTGGCCAGCGAGGATTGAACATGTCACTGAACCCAACCGCTACCAGGTG 12 0 

Qy 125 TTCTTCTTCGGGACCCATGAGACCGCCCTGCTGGGCCCCAAGCACCTTTTTCCTTATGAG 184 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMI 
Db 121 TTCTTCTTCGGGACCCATGAGACCGCCCTGCTGGGCCCCAAGCACCTTTTTCCTTATGAG 18 0 

Qy 185 GAGTCCAAGGAGAGGTTCGGCAAGCCTAACAAGAGGCGCGGCTTCAGTGAGGGGCTGTGG 24 4 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIMIIIIIIII 
Db 181 GAGTCCAAGGAGAGGTTCGGCAAGCCTAACAAGAGGCGCGGCTTCAGTGAGGGGCTGTGG 24 0 

Qy 245 GAGATCGAGCACGACCCTATGGCTGAGGCCTCCCCTTGCCTGTGCCCAGATGAGGAGCAG 304 

IMIIIIIIIIIIIIIIIIIil IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIII 
Db 241 GAGATCGAGCACGACCCTATGGTTGAGGCCTCCCCTTGCCTGTGCCCAGATGAGGAGCAG 300 

Qy 3 05 CTTTGTGCCGAGGAGCCAGGGCCAGGAGAGGAGCCAGAGCCGGGGCAGGAGCTGGAGCCG 364 

IIIIIIIIIIIIIIIIMIIIIIIIIMIIMIIIMIIIIIIIIIIIIIIIIIIIIIM 

Db 3 01 CTTTGTGCCGAGGAGCCAGGGCCAGGAGAGGAGCCAGAGCCGGGGCAGGAGCTGGAGCCG 36 0 

Qy 3 65 GAATCCAGGCCTGAGCTGGAATCCATGCCTGAGCTGGAGGCAGAACCGAGGCCTGAGAAA 424 

IIIIIIIIIIIIMIIIIIIIIIIMIIMIIIIIIIIIIIIIMIIIIIIIIIMIIM 

Db 361 GAATCCAGGCCTGAGCTGGAATCCATGCCTGAGCTGGAGGCAGAACCGAGGCCTGAGAAA 42 0 

Qy 4 25 GAGTGT GAGCAGGAGCCGGAGCAGGAGCCGGAGCAGGAGCTGGAGCAG 4 72 

mill IIIIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIII 
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Db 


421 


GAGTGTGAGCAGGAGCCGGAGCAGGAGCCGGAGCAGGAGCCGGAGCAGGAGCTGGAGCAG 


480 


Ov 


473 


G AG CC GG AG CT GG AG CCGG AG CCG GAGC CGGAGC CG GAGC CGGAGCCGGAGC CCGAGC CC 


532 






IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIII 




Db 


481 


GAGCCGGAGCTGGAGCCGGAGCCG GAGC CGGAGC CGGAGCCGGAGC CGGAGCCCGAGCCC 


540 


Ov 

wy 


533 


rjARPrnnAGCCGGAGCCCCAGCCTGCCTATGACCTACTGGATGCCAAGGAGGAGCCTGGC 


592 






IIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIilMIIIIIIIIIIIIM 




Db 


541 


GAGCCGGAGCCGGAGCCCCAGCCTGCCTATGACCTACTGGATGCCAAGGAGGAGCCTGGC 


600 


Ov 


593 


C TP ATTG AGGC CG AG CC AGGAGAT CAGC AAGC CGAG CAAGTG CG AG AGCAGC ACGCTG AA 


652 






IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIII 




Db 


601 


CTCATTGAGGCCGAGCCAGGAGATCAGCAAGCCGAGCAAGTGCGAGAGCAGCACGCTGAA 


660 


Ov 
vy 


653 


GCTGAGGTCATGGCTGTAGTGGAGGAGCCGGAGAGTCTGAAGAGGAGCGCGGAGGATGAA 


712 






llllllllllillllllMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


661 


GCTGAGGTCATGGCTGTAGTGGAGGAGCCGGAGAGTCTGAAGAGGAGCGCGGAGGATGAA 


720 


Ov 
vy 


7 13 


CAGCCTCACAGTCCTCCCAAACGGCCCAGGGAGGCGGCGCCTGGCGCGCTGGAGATGGAG 


772 






IIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


721 


CAGCCTCACAGTCCTCCCAAACGGCCCAGGGAGGCGGCGCCTGGCGCGCTGGAGATGGAG , 78 0 


Ov 
vy 


773 


CCGGCTGGAGAGCGCGAGGCAGAGGCCTGCCCCTTCGTGGAGGAGCCTGACCAAGCCCAG 


832 






llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


781 


CCGGCTGGAGAGCGCGAGGCAGAGGCCTGCCCCTTCGTGGAGGAGCCTGACCAAGCCCAG 


84 0 


Ov 
wy 


833 


GAACAGCAGACTCCGTTGGAAGAAGAGGCCACAGAGGAGGCAGTCCAGGGCCTGATGGTT 


892 






IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIII 




Db 


841 


GAACAGCAGACTCCGTTGGAAGAAGAGGCCACAGAGGAGGCAGTCCAGGGCCTGATGGTT 


900 


Qy 


893 


GGAGAAATCGAAGGCCTGTAG 913 








IIIIIIIIIIIMIIMIIII 




Db 


901 


GGAGAAATCGAAGGCCTGTAG 921 
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LOCUS (LOG) : 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 

SOURCE : 
ORGANISM (ORGN) : 



NUCLEIC ACID COUNT (NA) 
REFERENCE : 

AUTHOR (AU) : 



TITLE (TI) : 
JOURNAL (SO) : 



AF458587 GenBank (R) 

AF458587 

AF458587.1 GI: 21360801 

430805-71-3 

909 

mRNA; linear 
Primates 
10 Jun 2002 

Homo sapiens hepatoma -derived growth factor HGDF5 mRNA, 
complete cds. 
human . 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae; Homo 

: 209 a 245 c 331 g 124 t 

1 (bases 1 to 909) 

Zhang, P. Z.; Dong,Y.M.; Shan,Y.X.; Wei,Y.H.; Guo,J.H.; 
Yu,L. 

Direct Submission 

Submitted (15-DEC-2001) School of Life Sciences, 
Institute of Genetics, Fudan University, Handan Road, 
220, Shanghai 200433, China 



FEATURES (FEAT) : 
Feature Key 



Location 



Qualifier 



source 



CDS 



,909 



.909 



/organism="Homo sapiens" 
/ db - xr e f = " t axon : 9 6 0 6 " 
/tissue-type=" liver" 
/codon- start =1 

/produc t = " hepatoma - derived growth 

factor HGDF5" 

/protein- id= "AAM4 9715 .1" 

/db-xref="GI: 21360802" 

/translation="MSCFSRPKYKTGDLVFAKLK 

GYAHWPARIEHVTEPNRYQVFFFG 

THETALLGPKHLFPYEESKERFGKPNKRRGFSEG 

LWEIEHDPMVEASPCLCPDEEQLC 

AEEPGPGEEPEPGQELEPESRPELESMPELEAEP 

RPEKECEQEPEQEPEQELEQEPEL 

EPEPEPEPEPEPEPEPEPEPEPQPAYDLLDAKEE 

PGLIEAEPGDQQAEQVREQHAEAE 

VMAWEEPESLKRSAEDEQPHSPPKRPREAAPGA 

LEMEPAGEREAEACPFVEEPDQAQ 

EQQTPLEEEATEEAVQGLMVGEIEGL " 



SEQUENCE (SEQ) : 

1 atgtcttgct 
61 ggctatgccc 
121 ttcttcttcg 
181 gagtccaagg 
241 gagatcgagc 
301 ctttgtgccg 
361 gaatccaggc 
421 gagtgtgagc 
4 81 gagccggagc 
541 gagccccagc 
601 gagccaggag 
661 gctgtagtgg 
721 cctcccaaac 
781 cgcgaggcag 
841 ccgttggaag 



tcagccgccc 
attggccagc 
ggacccatga 
agaggttcgg 
acgaccctat 
aggagccagg 
ctgagctgga 
aggagccgga 
cggagccgga 
ctgcctatga 
atcagcaagc 
aggagccgga 
ggcccaggga 
aggcctgccc 
aagaggccac 



aaaatacaag 
gaggattgaa 
gaccgccctg 
caagcctaac 
99ttgaggcc 
gccaggagag 
atccatgcct 
gcaggagccg 
gccggagccg 
cctactggat 
cgagcaagtg 
gagtctgaag 
ggcggcgcct 
cttcgtggag 
agaggaggca 



accggggacc 
catgtcactg 
ctgggcccca 
aagaggcgcg 
tccccttgcc 
gagccagagc 
gagctggagg 
gagcaggagc 
gagccggagc 
gccaaggagg 
cgagagcagc 
aggagcgcgg 
ggcgcgctgg 
gagcctgacc 
gtccagggcc 



tggtgtttgc 
aacccaaccg 
agcacctttt 
gcttcagtga 
tgtgcccaga 
cggggcagga 
cagaaccgag 
tggagcagga 
ccgagcccga 
agcctggcct 
acgctgaagc 
aggatgaaca 
agatggagcc 
aagcccagga 
tgatggttgg 



caaattaaag 
ctaccaggtg 
tccttatgag 
ggggctgtgg 
tgaggagcag 
gctggagccg 
gcctgagaaa 
gccggagctg 
gccggagccg 
cattgaggcc 
tgaggtcatg 
gcctcacagt 
ggctggagag 
acagcagact 
agaaatcgaa 



901 ggcctgtag 
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<! --start Fragment > GenCore version 5.1.9 

Copyright (c) 1993 - 2006 Biocceleration Ltd. 



CM nucleic - nucleic search, using sw model 
Run on: November 22, 2006, 19:06:45 



Title : 

Perfect score j 
Sequence: 

Scoring table j 



Searched: 



; Search time 75 0 Seconds 
(without alignments) 
9203.368 Million cell updates/sec 



US-10-501-053 -1 
990 

1 cgctatgtcttgcttcagcc ttgggaaacccgctagggcc 990 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



5244920 seqs, 3486124231 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 10 0% 
Listing first 45 summaries 



10489840 



Database 



N_Geneseq_8 : 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 



geneseqnl980s i* 
geneseqnl990s : * 
geneseqn2000s : * 
genes eqn2 00 las : * 
geneseqn2 00 lbs : * 
geneseqn2002as : * 
geneseqn2002bs: * 
geneseqn2 003as : * 
geneseqn2003bs : * 
geneseqn2003cs :* 
geneseqn2003ds : * 
geneseqn2004as : * 
geneseqn20 04bs : * 
geneseqn20 05s : * 
geneseqn2006s : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



% 

Query 



» . 


Score 


Match 


Length 


DB 


ID 


Description 


1 


990 


100 


.0 


990 


9 


ADA37 366 


Ada3736 6 Human hep 


2 


885.4 


89 


.4 


921 


11 


ADQ96711 


Adq96711 Human Wol 


3 


511. 8 


51 


.7 


861 


11 


ADOS 98 24 


Ado5 9824 Human hep 


4 


505.2 


51 


.0 


882 


2 


AAV0173 0 


Aav0173 0 Mouse liv 


5 


347. 8 


35 


.1 


783 


2 


AAV57 58 2 


Aav57582 Nucleus -t 


6 


226. 8 


22 


9 


880 


4 


AAF30641 


Aaf 30641 Human hep 


7 


223.8 


22 


6 


756 


4 


AAD16358 


Aadl6358 Human sbg 
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8 


223 . 8 


22 


.6 


794 


12 


ACH917 08 


Ach91708 Human gen 


9 


154 


15 


. 6 


30191 


12 


ADQ9 76 54 


Adq97654 Mouse can 


10 


14 8 


14 


.9 


2779 


12 


ADJ7 5971 


Adj 75971 Marker ge 


11 


14 1 . 8 


14 


. 3 


110000 


14 


ADZ12814_2 


Continuation (3 of 


12 


138.4 


14 


.0 


1744 


15 


AEF3 62 79 


Aef36279 Hepatoma- 


13 


138 . 4 


14 


. 0 


247461 


13 


ABD3 3153 


Abd3 3153 Murine ca 


14 


137 . 8 


13 


. 9 


714 


2 


AATB8 42 0 


Aat88420 Mouse hep 


15 


137 . 8 


13 


. 9 


1563 


2 


AAV03 86 7 


Aav03 86 7 Mouse hep 


16 


135 


13 


. 6 


723 


2 


AAQ79 903 


Aaq79903 Human hep 


17 


13 5 


13 


. 6 


723 


10 


ADFl 02 08 


Adfl02 08 Human hep 


18 


13 5 


13 


. 6 


73 2 


3 


AAA4 0112 


Aaa40112 Human HDG 


19 


135 


13 


. 6 


834 


3 


AAA4 0117 


Aaa40117 Human HDG 


2 0 


135 


13 


. 6 


1386 


5 


AAF59 29 2 


Aaf59292 Human hHD 


2 1 


13 5 


13 


. 6 


2133 


3 


AAF18 04 9 


Aafl8 04 9 Lung cane 


22 


135 


13 


. 6 


2219 


12 


ADQ8 3188 


Adq8 3188 Human turn 


23 


135 


13 


. 6 


2219 


12 


ADQ8 43 69 


Adq843 69 Human turn 


24 


135 


13 


. 6 


2219 


13 


ADQ8 5101 


Adq8 5101 Human turn 


25 


135 


13 


. 6 


2219 


13 


ADQ8 3876 


Adq838 76 Human turn 


26 


135 


13 


. 6 


2219 


13 


ADQ8 62 91 


Adq862 91 Human tum 


27 


135 


13 


. 6 


2219 


13 


ACN3 74 80 


Acn3 74 80 Tumour -as 


28 


13 5 


13 


. 6 


2376 


2 


AAQ79 902 


Aaq79902 Human hep 


29 


135 


13 


. 6 


2376 


3 


AAA4 0118 


Aaa40118 Human HDG 


3 0 


135 


13 


. 6 


2376 


5 


ABA83 087 


Aba 83 087 Hepatoma- 


3 1 


135 


13 


. 6 


2376 


10 


ADFl 02 10 


Adfl0210 Human hep 


32 


135 


13 


. 6 


2376 


10 


ADK61214 


Adk61214 Ovarian c 


33 


135 


13 


.6 


2376 


12 


ADJ4 5234 


Adj 4 52 34 Human hep 


34 


135 


13 


.6 


2376 


13 


ADR2 5187 


Adr25187 Breast ca 


35 


135 


13 


. 6 


2376 


14 


ADY14323 


Adyl43 23 DNA encod 


36 


135 


13 


. 6 


2376 


14 


ADZ4 8952 


Adz48952 Insulin s 


37 


135 


13 


. 6 


2376 


15 


AEE60970 


Aee6 0970 Human pat 


38 


135 


13 


.6 


2388 


14 


AED7 34 24 


Aed734 24 Human pi a 


3 9 


133 . 4 


13 


.5 


2135 


10 


ADE3 1258 


Ade312 58 Human dia 


4 0 


132 


13 


. 3 


48 0 


9 


ACH45 23 2 


Acn45 23 2 Human roe 


41 


128 . 2 


12 


.9 


869 


2 


AAV3 915 4 


Aav3915 4 Lung grow 


42 


127 


12 


.8 


1926 


3 


AAA50254 


Aaa50254 Epstein B 


43 


12 7 


12 


.8 


1926 


4 


AAF82 902 


Aaf82 902 EBV tethe 


44 


127 


12 


.8 


1926 


10 


ADK6 55 80 


Adk655 80 Human her 


45 


127 


12 


.8 


1926 


14 


ADV6 8155 


Adv68155 Epstein-B 



ALIGNMENTS 



RESULT 1 
ADA37366 

ID ADA3 73 66 standard; cDNA; 9 90 BP. 
XX 

AC ADA3 73 66; 
XX 

DT 20-NOV-2003 (first entry) 
XX 

DE Human hepatoma -derived growth factor 5 cDNA. 
XX 

KW ds; gene; cytostatic; antiinflammatory; gene therapy; human; 

KW hepatoma -derived growth factor 5; HDGF5; diagnosis; malignant tumour; 

KW surgical efficiency; liver cancer; hepatitis; inflammation. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 5. .913 
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FT /*tag= a 

FT /products "Human hepatoma -derived growth factor 5" 

XX 

PN WO2003 057883 -Al. 
XX 

PD 17-JUL-2003. 
XX 

PF 02-JAN-2003; 2 003WO-CN0 00 003 . 
XX 

PR ll-JAN-2002; 2 002CN- 001 1053 5 . 
XX 

PA (YULL/) YU L. 
XX 

PI Yu L, Tang L, Guo Z, Zhang P, Dong Y; 
XX 

DR WPI; 2003-587131/55. 
DR P-PSDB; ADA37367. 
XX 

PT Human hepatoma -derived growth factor 5 and encoded polynucleotide, 

PT applicable in early diagnosis of malignant tumor, and development of 

PT drugs for hepatitis and acute inflammatory in various organs. 

XX 

PS Claim 1; Page 10; 26pp; Chinese. 
XX 

CC The invention relates to an isolated DNA encoding a human hepatoma- 
CC derived growth factor 5 (HDGF5), or sequences that have at least 70 % 
CC homology to the HDGF5 sequence. The polynucleotide and its encoded 
CC polypeptide are applicable in early diagnosis of malignant tumours, 
CC monitoring surgical efficiency and prognosis of liver cancer, and 
CC development of drugs for hepatitis and acute inflammation in various 
CC organs. This sequence represents the human HDGF5 cDNA. 
XX 

SQ Sequence 990 BP; 221 A; 273 C; 354 G; 142 T; 0 U; 0 Other; 



Query Match 100.0%; Score 990; DB 9; Length 990; 

Best Local Similarity 100.0%; Pred. No. 7.9e-176; 

Matches 990; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


CGCTATGTCTTGCTTCAGCCGCCCAAAATACAAGACCGGGGACCTGGTGTTTGCCAAATT 


60 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIII 




Db 


1 


CGCTATGTCTTGCTTCAGCCGCCCAAAATACAAGACCGGGGACCTGGTGTTTGCCAAATT 


60 


Qy 


61 


AAAGGGCTATGCCCATTGGCCAGCGAGGATTGAACATGTCACTGAACCCAACCGCTACCA 


12 0 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII 




Db 


61 


AAAGGGCTATGCCCATTGGCCAGCGAGGATTGAACATGTCACTGAACCCAACCGCTACCA 


12 0 


Qy 


121 


GGTGTTCTTCTTCGGGACCCATGAGACCGCCCTGCTGGGCCCCAAGCACCTTTTTCCTTA 


180 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIII 




Db 


121 


GGTGTTCTTCTTCGGGACCCATGAGACCGCCCTGCTGGGCCCCAAGCACCTTTTTCCTTA 


180 


Qy 


181 


TGAGGAGTCCAAGGAGAGGTTCGGCAAGCCTAACAAGAGGCGCGGCTTCAGTGAGGGGCT 


24 0 




IIMIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIMMIIilllllll 




Db 


181 


TGAGGAGTCCAAGGAGAGGTTCGGCAAGCCTAACAAGAGGCGCGGCTTCAGTGAGGGGCT 


24 0 


Qy 


241 


GTGGGAGATCGAGCACGACCCTATGGCTGAGGCCTCCCCTTGCCTGTGCCCAGATGAGGA 


300 






MIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIII 




Db 


241 


GTGGGAGATCGAGCACGACCCTATGGCTGAGGCCTCCCCTTGCCTGTGCCCAGATGAGGA 


300 


Qy 


301 


GCAGCTTTGTGCCGAGGAGCCAGGGCCAGGAGAGGAGCCAGAGCCGGGGCAGGAGCTGGA 


360 




III II II II II II II II II II III II II II II II INI II II II II II II II II II II II 




Db 


301 


GCAGCTTTGTGCCGAGGAGCCAGGGCCAGGAGAGGAGCCAGAGCCGGGGCAGGAGCTGGA 


360 
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Qy 3 61 GCCGGAATCCAGGCCTGAGCTGGAATCCATGCCTGAGCTGGAGGCAGAACCGAGGCCTGA 42 0 

lllllllillllllllllllMIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIII 

Db 3 61 GCCGGAATCCAGGCCTGAGCTGGAATCCATGCCTGAGCTGGAGGCAGAACCGAGGCCTGA 42 0 

Qy 421 GAAAGAGTGTGAGCAGGAGCCGGAGCAGGAGCCGGAGCAGGAGCTGGAGCAGGAGCCGGA 480 

IIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIMIII 

Db 4 21 GAAAGAGTGTGAGCAGGAGCCGGAGCAGGAGCCGGAGCAGGAGCTGGAGCAGGAGCCGGA 480 

Qy 4 81 GCTGGAGCCGGAGCCGGAGCCGGAGCCGGAGCCGGAGCCGGAGCCCGAGCCCGAGCCGGA 54 0 

IIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIII 
Db 4 81 GCTGGAGCCGGAGCCGGAGCCGGAGCCGGAGCCGGAGCCGGAGCCCGAGCCCGAGCCGGA 54 0 

Qy 541 GCCGGAGCCCCAGCCTGCCTATGACCTACTGGATGCCAAGGAGGAGCCTGGCCTCATTGA 600 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 541 GCCGGAGCCCCAGCCTGCCTATGACCTACTGGATGCCAAGGAGGAGCCTGGCCTCATTGA 60 0 

Qy 6 01 GGCCGAGCCAGGAGATCAGCAAGCCGAGCAAGTGCGAGAGCAGCACGCTGAAGCTGAGGT 66 0 

IIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIilllllMIIIIIIIIIIIIIIIIII 

Db 6 01 GGCCGAGCCAGGAGATCAGCAAGCCGAGC7VAGTGCGAGAGCAGCACGCTGAAGCTGAGGT 660 

Qy 661 CATGGCTGTAGTGGAGGAGCCGGAGAGTCTGAAGAGGAGCGCGGAGGATGAACAGCCTCA 72 0 

I II II II II II II II II II II Ml II II II II II II II II II II II II II II II II II II 
Db 661 CATGGCTGTAGTGGAGGAGCCGGAGAGTCTGAAGAGGAGCGCGGAGGATGAACAGCCTCA 72 0 

Qy 721 CAGTCCTCCCAAACGGCCCAGGGAGGCGGCGCCTGGCGCGCTGGAGATGGAGCCGGCTGG 78 0 

IIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMM 

Db 721 CAGTCCTCCCAAACGGCCCAGGGAGGCGGCGCCTGGCGCGCTGGAGATGGAGCCGGCTGG 78 0 

Qy 781 AGAGCGCGAGGCAGAGGCCTGCCCCTTCGTGGAGGAGCCTGACCAAGCCCAGGAACAGCA 84 0 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 781 AGAGCGCGAGGCAGAGGCCTGCCCCTTCGTGGAGGAGCCTGACCAAGCCCAGGAACAGCA 84 0 

Qy 841 GACTCCGTTGGAAGAAGAGGCCACAGAGGAGGCAGTCCAGGGCCTGATGGTTGGAGAAAT 90 0 

I II M II 1 1 I II I II I II II II I II I II II II I II 1 1 II 1 1 II 1 1 1 1 1 1 1 1 II II I II 1 1 

Db 841 GACTCCGTTGGAAGAAGAGGCCACAGAGGAGGCAGTCCAGGGCCTGATGGTTGGAGAAAT 900 

Qy 901 CGAAGGCCTGTAGTCACGGTGTCTGTAAAAGAGCCCTCTCTACCCGTTCCTGGTGCCACC 96 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 901 CGAAGGCCTGTAGTCACGGTGTCTGTAAAAGAGCCCTCTCTACCCGTTCCTGGTGCCACC 96 0 

Qy 961 TGGCTGTGGCTTGGGAAACCCGCTAGGGCC 990 

IIIIIIIMIIIIIIIIIIIIIIIMIIII 

Db 961 TGGCTGTGGCTTGGGAAACCCGCTAGGGCC 9 90 



RESULT 2 

ADQ96711 

ID ADQ96711 standard; cDNA; 921 BP. 
XX 

AC ADQ96711; 
XX 

DT 23-SEP-2004 (first entry) 
XX 

DE Human Wolf -Hirschhorn syndrome -related protein (WHSRP) cDNA. 

XX 

KW Wolf -Hirschhorn syndrome -related protein; WHSRP; cell differentiation; 

KW development; human; ss; gene. 

XX 

OS Homo sapiens . 
XX 
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FH Key Location/Qualifiers 
FT CDS 1. .921 

FT /*tag= a 

FT /product = "Human Wolf -Hirschhorn syndrome -related protein 

FT (WHSRP)" 

XX 

PN CN1376796-A, 
XX 

PD 30-OCT-2002. 
XX 

PF 04-APR-2002; 2 002CN-' 001 1126 7 . 
XX 

PR 04-APR-2002; 2 002CN- 001 1126 7 . 
XX 

PA (UYFU-) UNIV FUDAN. 
XX 

PI Yu L, Tang L, She X; 
XX 

DR WPI; 2003-202161/20. 
DR P-PSDB; ADQ96712. 

XX 

PT Coding sequence of novel human protein and its preparing process and 

PT application. 

XX 

PS Claim 1; SEQ ID NO 1; 17pp; Chinese. 
XX 

CC The invention relates to a novel cDNA sequence for the human Wolf- 

CC Hirschhorn syndrome -related protein (hWHSRP) which is associated with 

CC human cell differentiation and development and the encoded polypeptide, 

CC in addition to a process for preparing and applying the said 

CC polynucleotide and polypeptide sequences. The current sequence is that of 

CC the human WHSRP cDNA of the invention. 

XX 

SQ Sequence 921 BP; 212 A; 248 C; 337 G; 124 T; 0 U; 0 Other; 

Query Match 89.4%; Score 885.4; DB 11; Length 921; 

Best Local Similarity 98.6%; Pred. No. 2.8e-156; 

Matches 908; Conservative 0; Mismatches 1; Indels 12; Gaps 1; 

ATGTCTTGCTTCAGCCGCCCAAAATACAAGACCGGGGACCTGGTGTTTGCCAAATTAAAG 64 

IIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIII 

ATGTCTTGCTT CAGCCG CC CAAT^TACAAG AC CGGGGACCTGGTGTTTGC CAAATT AAAG 6 0 

GGCTATGCCCATTGGCCAGCGAGGATTGAACATGTCACTGAACCCAACCGCTACCAGGTG 12 4 

Mllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
GGCTATGCCCATTGGCCAGCGAGGATTGAACATGTCACTGAACCCAACCGCTACCAGGTG 12 0 

TTCTTCTTCGGGACCCATGAGACCGCCCTGCTGGGCCCCAAGCACCTTTTTCCTTATGAG 184 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIII 

TTCTTCTTCGGGACCCATGAGACCGCCCTGCTGGGCCCCAAGCACCTTTTTCCTTATGAG 18 0 
185 GAGTCCAAGGAGAGGTTCGGCAAGCCTAACAAGAGGCGCGGCTTCAGTGAGGGGCTGTGG 24 4 

II II II II II II II II II II III II II II II II II II II II II II MM 11 II II II II 

181 GAGTCCAAGGAGAGGTTCGGCAAGCCTAACAAGAGGCGCGGCTTCAGTGAGGGGCTGTGG 24 0 

GAGATCGAGCACGACCCTATGGCTGAGGCCTCCCCTTGCCTGTGCCCAGATGAGGAGCAG 304 

II II II II I II II II M I I II I II II II I II I II I II I I II II II M II II II I II I II 
GAGATCGAGCACGACCCTATGGTTGAGGCCTCCCCTT6CCTGTGCCCAGATGAGGAGCAG 300 

CTTTGTGCCGAGGAGCC AGGGCCAGGAGAGGAGCCAGAGCCGGGGCAGGAGCTGGAGCCG 364 

I II II MM MM MM II Mill MM MM II MM MM II MINI MM MM II 



Qy 


5 


Db 


1 


Qy 


65 


Db 


61 


Qy 


125 


Db 


121 


Qy 


185 


Db 


181 


Qy 


245 


Db 


241 


Qy 


3 05 
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Db 3 01 CTTTGTGCCGAGGAGCCAGGGCCAGGAGAGGAGCCAGAGCCGGGGCAGGAGCTGGAGCCG 36 0 

Qy 365 GAATCCAGGCCTGAGCTGGAATCCATGCCTGAGCTGGAGGCAGAACCGAGGCCTGAGAAA 424 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 361 GAATCCAGGCCTGAGCTGGAATCCATGCCTGAGCTGGAGGCAGAACCGAGGCCTGAGAAA 42 0 

Qy 425 GAGTGT -GAGCAGGAGCCGGAGCAGGAGCCGGAGCAGGAGCTGGAGCAG 4 72 

mill IIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMII 
Db 421 GAGTGTGAGCAGG'AGCCGGAGCAGGAGCCGGAGCAGGAGCCGGAGCAGGAGCTGGAGCAG 48 0 

Qy 4 73 GAGCCGGAGCTGGAGCCGGAGCCGGAGCCGGAGCCGGAGCCGGAGCCGGAGCCCGAGCCC 53 2 

IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 

Db 4 81 GAGCCGGAGCTGGAGCCGGAGCCGGAGCCGGAGCCGGAGCCGGAGCCGGAGCCCGAGCCC 54 0 

Qy 533 GAGCCGGAGCCGGAGCCCCAGCCTGCCTATGACCTACTGGATGCCAAGGAGGAGCCTGGC 592 

I II II MM II MM II MM Mill II II II II Mil II MM II II II II II II II II 

Db 541 GAGCCGGAGCCGGAGCCCCAGCCTGCCTATGACCTACTGGATGCCAAGGAGGAGCCTGGC 600 

Qy 593 CTCATTGAGGCCGAGCCAGGAGATCAGCAAGCCGAGCAAGTGCGAGAGCAGCACGCTGAA 652 

IMIIIIIIIIIIIIMIIMIMIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIM 

Db 6 01 CTCATTGAGGCCGAGCCAGGAGATCAGCAAGCCGAGCAAGTGCGAGAGCAGCACGCTGAA 66 0 

Qy 653 GCTGAGGTCATGGCTGTAGTGGAGGAGCCGGAGAGTCTGAAGAGGAGCGCGGAGGATGAA 712 

IIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 661 GCTGAGGTCATGGCTGTAGTGGAGGAGCCGGAGAGTCTGAAGAGGAGCGCGGAGGATGAA 72 0 

Qy 713 CAGCCTCACAGTCCTCCCAAACGGCCCAGGGAGGCGGCGCCTGGCGCGCTGGAGATGGAG 772 

IIIIIMIIIIMIMIIIMIIIMIIIIIIIIIIMIIIIIIMIIMIIMMIIM 

Db 721 CAGCCTCACAGTCCTCCCAAACGGCCCAGGGAGGCGGCGCCTGGCGCGCTGGAGATGGAG 780 

Qy 7 73 CCGGCTGGAGAGCGCGAGGCAGAGGCCTGCCCCTTCGTGGAGGAGCCTGACCAAGCCCAG 83 2 

lllllllllllllllllirillllllllllllllllllMIIIIIIIIIIIIIIMIIM 

Db 781 CCGGCTGGAGAGCGCGAGGCAGAGGCCTGCCCCTTCGTGGAGGAGCCTGACCAAGCCCAG 84 0 

Qy 833 GAACAGCAGACTCCGTTGGAAGAAGAGGCCACAGAGGAGGCAGTCCAGGGCCTGATGGTT 892 

llllllllllll III IIIIIIIMMMIIIIIIII llllllllllllllllllllllll 

Db 841 GAACAGCAGACTCCGTTGGAAGAAGAGGCCACAGAGGAGGCAGTCCAGGGCCTGATGGTT 900 

Qy 8 93 GGAGAAATCGAAGGCCTGTAG 913 

lllllllllllllllllllll 
Db 9 01 GGAGAAATCGAAGGCCTGTAG 921 

< ! - - EndFragment - - > 
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